Loss Surfaces, Mode Connectivity, and Fast Ensembling of DNNs

نویسندگان

  • Timur Garipov
  • Pavel Izmailov
  • Dmitrii Podoprikhin
  • Dmitry P. Vetrov
  • Andrew Gordon Wilson
چکیده

The loss functions of deep neural networks are complex and their geometric properties are not well understood. We show that the optima of these complex loss functions are in fact connected by a simple polygonal chain with only one bend, over which training and test accuracy are nearly constant. We introduce a training procedure to discover these high-accuracy pathways between modes. Inspired by this new geometric insight, we also propose a new ensembling method entitled Fast Geometric Ensembling (FGE). Using FGE we can train high-performing ensembles in the time required to train a single model. We achieve improved performance compared to the recent state-of-the-art Snapshot Ensembles, on CIFAR10 and CIFAR-100, using state-of-the-art deep residual networks. On ImageNet we improve the top-1 error-rate of a pre-trained ResNet by 0.56% by running FGE for just 5 epochs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Two-Surfaces Sliding Mode Controller for Energy Management of Electric Vehicle Based on Multi Input DC-DC Converter

In this paper, a two-surfaces sliding mode controller (TSSMC) is proposed for the voltage tracking control of a two input DC-DC converter in application of electric vehicles (EVs). The imperialist competitive algorithm (ICA) is used for tuning TSSMC parameters. The proposed controller significantly improves the transient response and disturbance rejection of the two input converters while p...

متن کامل

کاربرد سطوح تناسب و مقاومت زیستگاهی در ارزیابی تغییرات زیستگاهی

Habitats have dramatically destructed worldwide.However a growing trend is emerging for restoring habitatats. One of the most effective approach to revitalize them is to restore the conditions that have lost. Studies indicate high probability of local extinction of Maral (Cervus elaphus maral) in the current habitats of Gilan due to severe habitat destruction. The current study aimed to introdu...

متن کامل

Fast and Accurate Inference with Adaptive Ensemble Prediction in Image Classification with Deep Neural Networks

Ensembling multiple predictions is a widely used technique to improve the accuracy of various machine learning tasks. In image classification tasks, for example, averaging the predictions for multiple patches extracted from the input image significantly improves accuracy. Using multiple networks trained independently to make predictions improves accuracy further. One obvious drawback of the ens...

متن کامل

ENERGY AWARE DISTRIBUTED PARTITIONING DETECTION AND CONNECTIVITY RESTORATION ALGORITHM IN WIRELESS SENSOR NETWORKS

 Mobile sensor networks rely heavily on inter-sensor connectivity for collection of data. Nodes in these networks monitor different regions of an area of interest and collectively present a global overview of some monitored activities or phenomena. A failure of a sensor leads to loss of connectivity and may cause partitioning of the network into disjoint segments. A number of approaches have be...

متن کامل

Design of a Novel Framework to Control Nonlinear Affine Systems Based on Fast Terminal Sliding-Mode Controller

In this paper, a novel approach for finite-time stabilization of uncertain affine systems is proposed. In the proposed approach, a fast terminal sliding mode (FTSM) controller is designed, based on the input-output feedback linearization of the nonlinear system with considering its internal dynamics. One of the main advantages of the proposed approach is that only the outputs and external state...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1802.10026  شماره 

صفحات  -

تاریخ انتشار 2018